Search CORE

233 research outputs found

Differential expression analysis for sequence count data

Author: A Agresti
A Mortazavi
AC Cameron
AM Smith
AS Morrissy
B Langmead
C Loader
CI Bliss
DD Licatalosi
G Robertson
GK Smyth
GK Smyth
I Lönnstedt
J Bullard
JC Marioni
JF Lawless
JS Bloom
K Saha
L Wang
L Whitaker
M Kasowski
MD Robinson
MD Robinson
MD Robinson
MD Robinson
P Engström
P McCullagh
RC Gentleman
Simon Anders
SJ Clark
U Nagalakshmi
Wolfgang Huber
Y Benjamini
Publication venue
Publication date: 01/01/2010
Field of study

*Motivation:* High-throughput nucleotide sequencing provides quantitative readouts in assays for RNA expression (RNA-Seq), protein-DNA binding (ChIP-Seq) or cell counting (barcode sequencing). Statistical inference of differential signal in such data requires estimation of their variability throughout the dynamic range. When the number of replicates is small, error modelling is needed to achieve statistical power.

*Results:* We propose an error model that uses the negative binomial distribution, with variance and mean linked by local regression, to model the null distribution of the count data. The method controls type-I error and provides good detection power. 

*Availability:* A free open-source R software package, _DESeq_, is available from the Bioconductor project and from "http://www-huber.embl.de/users/anders/DESeq":http://www-huber.embl.de/users/anders/DESeq

Crossref

Springer

Springer - Publisher Connector

PubMed Central

Institute of Mathematics AS CR, v. v. i.

Nature Precedings

Cytoplasmic Polyadenylation Element Binding Protein Deficiency Stimulates PTEN and Stat3 mRNA Translation and Induces Hepatic Insulin Resistance

Author: A Mora
AR Morris
B Feve
Bryan O'Sullivan-Murphy
D Burns
Dae Young Jung
DC Barnard
DD Licatalosi
DD Licatalosi
DM Burns
Fumihiko Urano
H Herranz
HJ Kim
Hwi Jin Ko
I Groisman
I Groisman
Ilya M. Alexandrov
J Paris
J Tay
Jason K. Kim
JD Keene
JD Richter
JH Kim
JM Alarcon
Joel D. Richter
K Ueki
L Wu
LE Hake
Maria Ivshina
Mei Xu
MF White
N Liu
P Anderson
R Mendez
Randall Friedline
Rita Bortell
S Nottrott
S Wang
SE Kahn
T Boettger
T Maniatis
W Huang da
W Huang da
Wataru Ogawa
Yen-Tsung Huang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The cytoplasmic polyadenylation element binding protein CPEB1 (CPEB) regulates germ cell development, synaptic plasticity, and cellular senescence. A microarray analysis of mRNAs regulated by CPEB unexpectedly showed that several encoded proteins are involved in insulin signaling. An investigation of Cpeb1 knockout mice revealed that the expression of two particular negative regulators of insulin action, PTEN and Stat3, were aberrantly increased. Insulin signaling to Akt was attenuated in livers of CPEB–deficient mice, suggesting that they might be defective in regulating glucose homeostasis. Indeed, when the Cpeb1 knockout mice were fed a high-fat diet, their livers became insulin-resistant. Analysis of HepG2 cells, a human liver cell line, depleted of CPEB demonstrated that this protein directly regulates the translation of PTEN and Stat3 mRNAs. Our results show that CPEB regulated translation is a key process involved in insulin signaling

CiteSeerX

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

eScholarship@UMMS

FigShare

FRA2A is a CGG repeat expansion associated with silencing of AFF3

Author: A Ruiz-Herrera
A Ruiz-Herrera
A Tukun
AJMH Verkerk
AR La Spada
B Winnepenninckx
C Jones
C Ma
C McMurray
CE Pearson
CE Pearson
Chandra Sekhar Reddy Chilamakuri
Christopher E. Pearson
D Kumari
David I. Wilson
David R. FitzPatrick
DD Licatalosi
DD Rudnicki
DS Murthy
E de Graaff
E Steichen-Gersdorf
Edwin Reyniers
Eric Haan
Evelyn Douglas
G Annerén
Geert Vandeweyer
Geoffrey Thompson
Harris Morrison
Hemant Bengani
J Benítez
J Gécz
J Rainger
J Tost
Jacqueline Rainger
JE Parrish
JK Nancarrow
Jozef Gecz
K Debacker
K Debacker
K Gronskov
K Mondal
KE Davies
L Chakrabarti
Liesbeth Rooms
M Kato
M Melko
M Pieretti
M Pieretti
M Wojciechowska
MA Costa Lima
MA Lancaster
Martin S. Taylor
MM Axford
O Britanova
R Illingworth
R Kodzius
R O'Rahilly
R Willemsen
R. Frank Kooy
RI Richards
RI Richards
SJL Knight
SL Nolin
Sofie Metsu
SS Chong
T Sarafidou
T Taki
T Zu
X Liao
Y Gu
Y Trottier
YaW Lin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Folate-sensitive fragile sites (FSFS) are a rare cytogenetically visible subset of dynamic mutations. Of the eight molecularly characterized FSFS, four are associated with intellectual disability (ID). Cytogenetic expression results from CGG tri-nucleotide-repeat expansion mutation associated with local CpG hypermethylation and transcriptional silencing. The best studied is the FRAXA site in the FMR1 gene, where large expansions cause fragile X syndrome, the most common inherited ID syndrome. Here we studied three families with FRA2A expression at 2q11 associated with a wide spectrum of neurodevelopmental phenotypes. We identified a polymorphic CGG repeat in a conserved, brain-active alternative promoter of the AFF3 gene, an autosomal homolog of the X-linked AFF2/FMR2 gene: Expansion of the AFF2 CGG repeat causes FRAXE ID. We found that FRA2A-expressing individuals have mosaic expansions of the AFF3 CGG repeat in the range of several hundred repeat units. Moreover, bisulfite sequencing and pyrosequencing both suggest AFF3 promoter hypermethylation. cSNP-analysis demonstrates monoallelic expression of the AFF3 gene in FRA2A carriers thus predicting that FRA2A expression results in functional haploinsufficiency for AFF3 at least in a subset of tissues. By whole-mount in situ hybridization the mouse AFF3 ortholog shows strong regional expression in the developing brain, somites and limb buds in 9.5-12.5dpc mouse embryos. Our data suggest that there may be an association between FRA2A and a delay in the acquisition of motor and language skills in the families studied here. However, additional cases are required to firmly establish a causal relationship

Southampton (e-Prints Soton)

Crossref

Adelaide Research & Scholarship

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

FigShare

DGCR8 HITS-CLIP reveals novel functions for the Microprocessor

Author: A Shenoy
A Shiohama
A Shiohama
Agata Stajuda
AM Denli
BN Davis
C Ender
D Tollervey
DD Licatalosi
DD Licatalosi
DG Zisoulis
DP Bartel
E Bernstein
E Bernstein
E Lund
Eduardo Eyras
FV Karginov
G Hutvagner
G Michlewski
Gracjan Michlewski
GS Slater
H Wu
J Han
J Han
J Han
J Konig
J Krol
J Ule
J Winter
Javier F Cáceres
JF Caceres
JM Pawlicki
JR Sanford
K Fenelon
KL Stark
M Faller
M Faller
M Hafner
M Landthaler
M Morlando
Mireya Plass
MM Chong
MS Scott
MT Bohnsack
P Flicek
P Ji
PA Fujita
R Triboulet
R Yi
RI Gregory
RJ Taft
S Guil
S Kadener
Sara Macias
SW Chi
T Kiss
TJ Liang
Y Wang
Y Zeng
YT Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

The Drosha-DGCR8 complex (Microprocessor) is required for microRNA (miRNA) biogenesis. DGCR8 recognizes the RNA substrate, whereas Drosha functions as the endonuclease. High-throughput sequencing and crosslinking immunoprecipitation (HITS-CLIP) was used to identify RNA targets of DGCR8 in human cells. Unexpectedly, miRNAs were not the most abundant targets. DGCR8-bound RNAs also comprised several hundred mRNAs as well as snoRNAs and long non-coding RNAs. We found that the Microprocessor controls the abundance of several mRNAs as well as of MALAT-1. By contrast, DGCR8-mediated cleavage of snoRNAs is independent of Drosha, suggesting the involvement of DGCR8 in cellular complexes with other endonucleases. Interestingly, binding of DGCR8 to cassette exons, acts as a novel mechanism to regulate the relative abundance of alternatively spliced isoforms. Collectively, these data provide new insights in the complex role of DGCR8 in controlling the fate of several classes of RNAs

Crossref

PubMed Central

Copenhagen University Research Information System

Edinburgh Research Explorer

UPF Digital Repository

Predicting RNA-Protein Interactions Using Only Sequence Information

Author: A Barkan
A Martínez-antonio
A Pacheco
AP Gerber
B Blencowe
BA Lewis
C Charon
D Ray
D Ursic
DD Licatalosi
DD Licatalosi
DJ Hogan
Drena Dobbs
E Kaymak
H Hwang
HM Berman
HM Berman
I Breiman
I Sola
J Shen
JC Nacher
JD Keene
JG Lees
JR Sanford
KB Cook
L Pérez-Cano
M Bellucci
M Hafner
M Hafner
M Hall
M Khorshid
M Terribilini
MY Kim
N Mittal
NG Tsvetanova
P Baldi
P Zhou
S Kishore
S Lee
T Wu
T-Y Wang
TE Baroni
TI Lee
Usha K Muppirala
V Pancaldi
V Vapnik
Vasant G Honavar
VP Vidal
X Shao
X-W Chen
Y Wang
Z Li
Z-P Liu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background RNA-protein interactions (RPIs) play important roles in a wide variety of cellular processes, ranging from transcriptional and post-transcriptional regulation of gene expression to host defense against pathogens. High throughput experiments to identify RNA-protein interactions are beginning to provide valuable information about the complexity of RNA-protein interaction networks, but are expensive and time consuming. Hence, there is a need for reliable computational methods for predicting RNA-protein interactions. Results We propose <it>RPISeq</it>, a family of classifiers for predicting <it>R</it>NA-<it>p</it>rotein <it>i</it>nteractions using only <it>seq</it>uence information. Given the sequences of an RNA and a protein as input, <it>RPIseq </it>predicts whether or not the RNA-protein pair interact. The RNA sequence is encoded as a normalized vector of its ribonucleotide 4-mer composition, and the protein sequence is encoded as a normalized vector of its 3-mer composition, based on a 7-letter reduced alphabet representation. Two variants of <it>RPISeq </it>are presented: <it>RPISeq-SVM</it>, which uses a Support Vector Machine (SVM) classifier and <it>RPISeq-RF</it>, which uses a Random Forest classifier. On two non-redundant benchmark datasets extracted from the Protein-RNA Interface Database (PRIDB), <it>RPISeq </it>achieved an AUC (Area Under the Receiver Operating Characteristic (ROC) curve) of 0.96 and 0.92. On a third dataset containing only mRNA-protein interactions, the performance of <it>RPISeq </it>was competitive with that of a published method that requires information regarding many different features (e.g., mRNA half-life, GO annotations) of the putative RNA and protein partners. In addition, <it>RPISeq </it>classifiers trained using the PRIDB data correctly predicted the majority (57-99%) of non-coding RNA-protein interactions in NPInter-derived networks from <it>E. coli, S. cerevisiae, D. melanogaster, M. musculus</it>, and <it>H. sapiens</it>. Conclusions Our experiments with <it>RPISeq </it>demonstrate that RNA-protein interactions can be reliably predicted using only sequence-derived information. <it>RPISeq </it>offers an inexpensive method for computational construction of RNA-protein interaction networks, and should provide useful insights into the function of non-coding RNAs. <it>RPISeq </it>is freely available as a web-based server at <url>http://pridb.gdcb.iastate.edu/RPISeq/.</url></p

Digital Repository @ Iowa State University (ISU)

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Detection and Removal of Biases in the Analysis of Next-Generation Sequencing Reads

Author: A Barski
A Valouev
AC Seila
AP Boyle
AP Fejes
AR Kornblihtt
B Li
C Wang
DD Licatalosi
DD Licatalosi
E Hodges
ET Wang
G Hon
G Kunarso
GA Heap
Gil Ast
GW Muse
H Tilgner
I Listerman
IE Schor
J Li
J Rozowsky
J Zeitlinger
JC Dohm
JC Marioni
JF Degner
JW Brown
KD Hansen
KJ Gaulton
L Laurent
L Zhu
LJ Core
LW Hillier
M de la Mata
M Kircher
MJ Weber
ML Metzker
N Philippe
N Sela
N Spies
P Flicek
P Kolasinska-Zwierz
P Medvedev
Purification Lopez-Garcia
R Andersson
R Lister
R Morin
Ram Oren
S Griffiths-Jones
S Griffiths-Jones
S Nahkuri
S Pepke
S Schwartz
Schraga Schwartz
T Kiss
T Kiss
T Kiss
TH Kim
W Chen
W Filipowicz
Y Gilad
Z Wang
Z Wang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Since the emergence of next-generation sequencing (NGS) technologies, great effort has been put into the development of tools for analysis of the short reads. In parallel, knowledge is increasing regarding biases inherent in these technologies. Here we discuss four different biases we encountered while analyzing various Illumina datasets. These biases are due to both biological and statistical effects that in particular affect comparisons between different genomic regions. Specifically, we encountered biases pertaining to the distributions of nucleotides across sequencing cycles, to mappability, to contamination of pre-mRNA with mRNA, and to non-uniform hydrolysis of RNA. Most of these biases are not specific to one analyzed dataset, but are present across a variety of datasets and within a variety of genomic contexts. Importantly, some of these biases correlated in a highly significant manner with biological features, including transcript length, gene expression levels, conservation levels, and exon-intron architecture, misleadingly increasing the credibility of results due to them. We also demonstrate the relevance of these biases in the context of analyzing an NGS dataset mapping transcriptionally engaged RNA polymerase II (RNAPII) in the context of exon-intron architecture, and show that elimination of these biases is crucial for avoiding erroneous interpretation of the data. Collectively, our results highlight several important pitfalls, challenges and approaches in the analysis of NGS reads

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Mapping exosome-substrate interactions in vivo by UV cross-linking

Author: AC Tuck
C Delan-Forino
C Delan-Forino
DD Licatalosi
EL Van Nostrand
F Ramírez
J Konig
JJ Tree
M Dodt
M Hafner
P Flicek
S Granneman
S Granneman
S Schneider
S Webb
V Libri
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Crossref

Edinburgh Research Explorer

Evolutionary Constraint Helps Unmask a Splicing Regulatory Region in BRCA1 Exon 11

Author: Andrew G. L. Douglas
AR Grosso
C Yuli
CA Wilson
Claudia Tammaro
David I. Wilson
DD Licatalosi
Diana Baralle
F Pagani
F Pagani
F Piva
I Paz
J Lee
JR Thompson
JV Chamary
KA Dittmar
L Good
LD Hurst
Ludmila Prokunina-Olsson
M Lu
Michela Raponi
P de la Grange
PD Ryan
R Bachelier
RD Brandão
S Thakur
TI Orban
TI Orban
TI Orban
W Tang
Y Qin
ZE Sauna
Publication venue: Public Library of Science
Publication date: 16/05/2012
Field of study

BACKGROUND: Alternative splicing across exon 11 produces several BRCA1 isoforms. Their proportion varies during the cell cycle, between tissues and in cancer suggesting functional importance of BRCA1 splicing regulation around this exon. Although the regulatory elements driving exon 11 splicing have never been identified, a selective constraint against synonymous substitutions (silent nucleotide variations that do not alter the amino acid residue sequence) in a critical region of BRCA1 exon 11 has been reported to be associated with the necessity to maintain regulatory sequences. METHODOLOGY/PRINCIPAL FINDINGS: Here we have designed a specific minigene to investigate the possibility that this bias in synonymous codon usage reflects the need to preserve the BRCA1 alternative splicing program. We report that in-frame deletions and translationally silent nucleotide substitutions in the critical region affect splicing regulation of BRCA1 exon 11. CONCLUSIONS/SIGNIFICANCE: Using a hybrid minigene approach, we have experimentally validated the hypothesis that the need to maintain correct alternative splicing is a selective pressure against translationally silent sequence variations in the critical region of BRCA1 exon 11. Identification of the trans-acting factors involved in regulating exon 11 alternative splicing will be important in understanding BRCA1-associated tumorigenesis

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

CRISPR-assisted detection of RNA-protein interactions in living cells.

Author: A Castello
D Kim
D Kim
D Szklarczyk
DD Licatalosi
E Yildirim
EC Urdaneta
H Mira-Bontenbal
J Feng
J Hou
J Li
J Rozowsky
J Trendel
JE Wilusz
JN Hutchinson
JT Lee
K Kapeli
KM Chan
L Baranello
M Ghandi
M Kretz
M Ramanathan
M Ramanathan
MA Hakimi
P Shannon
R Breitling
R Lorenz
RML Queiroz
S Konermann
SA Myers
T Jegu
V Luga
W Huang da
W Huang da
Z Hou
Z Lu
Publication venue: Nat Methods
Publication date: 01/07/2020
Field of study

We have developed CRISPR-assisted RNA-protein interaction detection method (CARPID), which leverages CRISPR-CasRx-based RNA targeting and proximity labeling to identify binding proteins of specific long non-coding RNAs (lncRNAs) in the native cellular context. We applied CARPID to the nuclear lncRNA XIST, and it captured a list of known interacting proteins and multiple previously uncharacterized binding proteins. We generalized CARPID to explore binders of the lncRNAs DANCR and MALAT1, revealing the method's wide applicability in identifying RNA-binding proteins

Crossref

Apollo (Cambridge)

miREE: miRNA recognition elements ensemble

Author: A Grimson
AA Khan
Andrea Acquaviva
B John
BP Lewis
C Barreau
CC Chang
D Bartel
D Gaidatzis
DD Licatalosi
DP Bartel
DW Thomson
Elisa Ficarra
Enrico Macii
F Xiao
GL Papadopoulos
H Mühlenbein
IL Hofacker
J Kruger
KC Miranda
M Hafner
M Kertesz
M Lindow
M Maragkakis
M Selbach
M Yousef
N Rajewsky
ND Mendes
O Saetrom
P Alexiou
Paula H Reyes-Herrera
PH Reyes-Herrera
RC Friedman
S Bandyopadhyay
S Lall
S Yoon
Simon
SK Kim
T Schmidt
V Chandra
X Wang
X Yan
Y Yang
Y Zhao
YW Chen
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Computational methods for microRNA target prediction are a fundamental step to understand the miRNA role in gene regulation, a key process in molecular biology. In this paper we present miREE, a novel microRNA target prediction tool. miREE is an ensemble of two parts entailing complementary but integrated roles in the prediction. The Ab-Initio module leverages upon a genetic algorithmic approach to generate a set of candidate sites on the basis of their microRNA-mRNA duplex stability properties. Then, a Support Vector Machine (SVM) learning module evaluates the impact of microRNA recognition elements on the target gene. As a result the prediction takes into account information regarding both miRNA-target structural stability and accessibility. Results The proposed method significantly improves the state-of-the-art prediction tools in terms of accuracy with a better balance between specificity and sensitivity, as demonstrated by the experiments conducted on several large datasets across different species. miREE achieves this result by tackling two of the main challenges of current prediction tools: (1) The reduced number of false positives for the Ab-Initio part thanks to the integration of a machine learning module (2) the specificity of the machine learning part, obtained through an innovative technique for rich and representative negative records generation. The validation was conducted on experimental datasets where the miRNA:mRNA interactions had been obtained through (1) direct validation where even the binding site is provided, or through (2) indirect validation, based on gene expression variations obtained from high-throughput experiments where the specific interaction is not validated in detail and consequently the specific binding site is not provided. Conclusions The coupling of two parts: a sensitive Ab-Initio module and a selective machine learning part capable of recognizing the false positives, leads to an improved balance between sensitivity and specificity. miREE obtains a reasonable trade-off between filtering false positives and identifying targets. miREE tool is available online at http://didattica-online.polito.it/eda/miREE/</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

PORTO Publications Open Repository TOrino